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ABSTRACT 

We present a technique which employs artificial neural networks to produce physical 
parameters for stellar spectra. A neural network is trained on a set of synthetic optical 
stellar spectra to give physical parameters (e.g. Tcft, logg, [M/H]). The network is then 
used to produce physical parameters for real, observed spectra. 

Our neural networks are trained on a set of 155 synthetic spectra, generated 
using the spectrum program written by Gray (Gray & Corbally 1994, Gray & Arlt 
1996). Once trained, the neural network is used to yield TcS for over 5000 B-K spectra 
extracted from a set of photographic objective prism plates (Bailer- Jones, Irwin & von 
Hippel 1997a). Using the MK classifications for these spectra assigned by Houk (1975, 
1978, 1982, 1988), we have produced a temperature calibration of the MK system based 
on this set of 5000 spectra. It is demonstrated through the metallicity dependence 
of the derived temperature calibration that the neural networks are sensitive to the 
metallicity signature in the real spectra. With further work it is likely that neural 
networks will be able to yield reliable metallicity measurements for stellar spectra. 

Key words: methods: data analysis, numerical - stars: fundamental parameters 



1 INTRODUCTION 

The MK classification system was first proposed in its cur- 
rent form in 1943 by Morgan, Keenan & Kellman (1943), and 
has since undergone a number of revisions (e.g. Keenan & 
McNeil (1976), Morgan, Abt & Tapscott (1978)). MK clas- 
sification is the only widely used system for stellar spectral 
classification. Over its history it has contributed towards a 
number of important developments in astronomy, such as 
the further development of the now-famous HR diagram 
(Hertzprung 1911, Russell 1914) and the identification of 
anomalous stars. Currently, MK classification is largely used 
as a tool in the preliminary analysis of unusual stars, and in 
selecting stellar samples for further study. 

An often-stated advantage of the MK system is that 
its classifications, often based upon the visual inspection 
of spectra, are static because they are based on a set of 
standards. However, a given spectrum may be classified 
differently by different people, and any one person may 
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also classify a given spectrum differently at different times. 
These problems of subjectivity could be partially alleviated 
through the use of automated classifiers (von Hippel et al. 
1994, Bailer- Jones et al. 1997a). Automated classifiers could 
also produce quantitative errors associated with their clas- 
sifications. Another problem with the MK system is that it 
lacks a well-defined metallicity parameter, whereas metalli- 
city variations are known to have a significant effect on the 
appearance of high (~ 1 A) resolution spectra. This limits 
the system to classifications of bright, nearby stars which 
show only limited metallicity variations. 

Attempts to extend and revise MK classification (e.g. 
Corbally, Gray & Garrison 1994) may well prove valuable, 
but as our understanding of stellar spectra grows, particu- 
larly from computational work with model atmospheres and 
synthetic spectra, it becomes increasingly desirable to ob- 
tain reliable physical parameterizations of stars. Any classi- 
fication system is a compromise between retaining the full 
information in the spectrum and the need for a compact 
summary of it. The optimal 'summary' is of course given by 
the physical parameters. Advances in computational power 
and data storage since the inception of MK classification 
mean that it is now practicable to process and store large 
numbers of spectra. The development of fast, automated 
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classifiers will mean that it is feasible to 'classify' large num- 
bers of stellar spectra in terms of their physical parameters 
and to re-classify them rapidly whenever stellar models are 
improved. Physical parameters should be obtained from an 
original spectrum, rather than an empirical classification, as 
the latter may well disregard certain spectral features which 
later turn out to be important. One of the advantages of the 
MK classification system is that it is an empirical system 
based on unchanging standards, whereas any direct param- 
eterization of a spectrum depends upon the quality of stellar 
models and will change as these improve. MK classifications 
could remain as useful labels giving a rough 'feel' for a spec- 
trum, and the work of von Hippel et al. (1994) and Bailer- 
Jones et al. (1997a) has shown that reliable automated MK 
classification is possible. In this paper we demonstrate how 
we extend our automated techniques to the determination 
of physical parameters directly from an observed spectrum. 



2 OBSERVATIONAL DATA 

The observational stellar spectra used in this project were 
taken from objective prism plates obtained in the Michigan 
Spectral Survey (Houk 1994). This was an objective prism 
survey of the whole southern sky (5 < -1-12°) from the Curtis 
Schmidt telescope at the Cerro Tololo Interamerican Obser- 
vatory in Chile. We were loaned 100 of these plates by Houk, 
which we digitized using the APM facility in Cambridge. 
The output from the APM is an optical density. Cawson 
et al. (1987) showed that the optical density calculated by 
the APM is, to a good approximation, linearly related to the 
incident intensity on the photographic plate for the typical 
range of optical density encountered for this type of extrac- 
tion. Once scanned, the digitized plates were reduced and 
the spectra extracted off-line. This yielded a set of over 5000 
spectra over the approximate spectral range 3800-5200 A at 
a resolution of ~3A. This data set covers a wide range of 
spectral types (B2-M7) for luminosity classes III, IV and 
V. The Michigan Spectral Survey was designed to be a re- 
classification of the Henry Draper stars. In keeping with pre- 
vious usage, we shall refer to our set of 5000 spectra from 
this survey as the MHD (Michigan Henry Draper) spectra. 
Further details of this data set and the spectral extraction 
technique are discussed in Bailer- Jones, Irwin & von Hippel 
(1997b). These spectra have been used in a related project 
to automate MK stellar classification using a neural network 
(Bailer- Jones et al. 1997a). 



3 NEURAL NETWORKS 

A neural network is a software device which can be trained 
to give a non-linear parameterized mapping between a num- 
ber of inputs (e.g. a complete stellar spectrum) and one or 
more outputs (e.g. parameters such as T^s or MK spectral 
type) . Figure nl shows a neural network architecture with a 
single output. Each node in the input layer holds a value, U. 
In our application the vector of inputs, (/i, /2, . . . , /i, . . .), is 
a stellar spectrum and the output is the effective tempera- 
ture, Teff. Each of the input nodes connects to every node in 
the next layer of nodes, the 'hidden' layer, and each of these 
connections has a weight, Wij, associated with it. The j*'' 



Input Layer ^ Hidden Layer ^ Output Layer 




Figure 1. Neural network architecture. A neural network con- 
sists of layers of nodes linked by weighted connections (weights) . 
Each node in the hidden and output layers performs a non-linear 
weighted sum of its inputs which it then passes to its output. 
Therefore the final output from the neural network is a non-linear 
function of the network inputs parameterized by the weights. The 
weights are found by 'training' the neural network using a set 
of inputs for which the ideal outputs (the 'target outputs') are 
known. This training process is therefore equivalent to interpo- 
lating the training data to find the underlying function relating 
the inputs to the output. 



node in the hidden layer forms a weighted sum of its inputs, 
given by Xj — ^ Wijh. It then passes this sum through 
a non-linear sigmoid transfer function to give the final out- 
put from this node, Hj = [1 + e~^^^)~^, where A is some 
constant. These outputs from the nodes in the hidden layer 
then serve as the inputs to the node in the output layer, 
which performs the same processing. The output from this 
final node is the network output. The non-linearity of the 
sigmoid function means that the network output is a non- 
linear function of the inputs. It can be shown that neural 
networks can model functions of arbitrary complexity (e.g. 
Bishop (1995) and references therein). Neural networks are 
therefore useful in a wide range of data modelling applica- 
tions. 

The key to producing the desired output from the net- 
work when it is presented with a certain input is to set the 
network parameters - the weights - to their correct val- 
ues. This is known as 'training' the network and requires 
a set of input data for which the associated ideal outputs 
(the 'target outputs') are known. Training takes place as 
follows. The weights are initially set with random values 
over a small range. Thus when a spectrum is fed into the 
network, the output will also be random. By comparing this 
output with what it should be (the target output), we can 
adjust the weights to give an output which is closer to the 
target value. This is repeated for each input/output pair in 
the training data set. The network is trained iteratively by 
successive passes of the training data through the network, 
and on each pass the weights are perturbed towards their op- 
timal values. Specifically, the network training is performed 
by minimizing the error function E — '^^{Ok — Tfc)^, where 
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Ok is the network output for a given input spectrum and Tk 
is the target output for that spectrum. Thus we can think 
of training a neural network as an A^- dimensional minimiza- 
tion problem in which we want to find those values of the TV 
network weights which minimize E. Once the weights have 
been found they are fixed and the neural network can be 
applied to any number of new inputs for which the outputs 
are not known. Training the neural network is simply the 
process of interpolating the multi-dimensional training data 
in order to produce an input-output mapping characteristic 
of the problem represented by these data. 

The output from a neural network is some non-linear 
function of all of the network inputs. In our particular ap- 
plication this means that the values of physical stellar pa- 
rameters which the neural network gives at its output are 
based on the appearance of the whole spectrum: we do not 
have to tell the network in advance which spectral lines are 
relevant. Based only upon the training data, the neural net- 
work will learn which wavelengths are more significant than 
others in determining the correct spectral parameters, and 
will express this by assigning appropriate values to the net- 
work weights. 

Neural networks have been applied in a number of ar- 
eas of astronomy, as reviewed by Storrie-Lombardi & Lahav 
(1994) and Miller (1993). Further theoretical details of neu- 
ral networks can be found in, for example. Hertz, Krogh 
& Palmer (1991), Bishop (1995), Lahav et al. (1996) and 
Bailer- Jones (1996). 



4 CALIBRATION PROCEDURE 

In the following sections, theoretical stellar spectra of given 
physical parameters (such as T^g and logg) are generated 
and processed to the same 'fiux' and dispersion scales as the 
observed MHD spectra described in the previous section. A 
neural network is trained on these synthetic spectra with 
their physical parameters as the network target outputs. 
Once trained, the neural network is used to give physical 
parameters for the MHD stars. This provides a means of 
rapidly and easily obtaining physical parameters for a large 
number of spectra without calibration via the MK system. 
However, a statistical comparison of these physical param- 
eters with the known MK classifications of the MHD stars 
provides a calibration between the MK system and physical 
stellar parameters. Training the network is very quick (~ 10 
seconds on a SUN Sparc 10), and applying it is even faster, 
so it is not prohibitive to re-train the network as improved 
models are obtained. 



5 SYNTHETIC SPECTRA 
5.1 Generation 

The synthetic spectra were created using the spectrum pro- 
gram written by Gray (Gray & Corbally 1994, Gray & Arlt 
1996). This program computes a synthetic spectrum un- 
der the assumption of Local Thermodynamic Equilibrium 
(LTE) using a model stellar atmosphere. We used the fully 
blanketed models calculated by Kurucz (1979, 1992). The 
model atmosphere is a tabulation of temperature and pres- 
sure at a range of 'mass depths' in the stellar photosphere. 



calculated on the basis of a variety of sources of opacity. 
These model atmospheres are characterized by four param- 
eters: metallicity, [M/H]; microturbulence velocity, Vmicro; 
surface gravity, logg; effective temperature, Tctt- Each spec- 
trum is uniquely labelled by these four parameters. To cal- 
culate the spectrum emergent from the model atmosphere, 
SPECTRUM also requires a table of atomic and molecular 
species which lists their relative abundances, masses, and 
ionization energies (or disassociation energies for molecules) . 
SPECTRUM is also given a line list, listing, for each line in the 
spectrum, the atomic or ionic species producing the line, 
the energy of the two bound electron energy levels involved 
in the transition, the oscillator strength of the transition 
and a damping factor. From these, spectrum calculates the 
densities of electrons, atoms and ions at different layers in 
the atmosphere, from which it determines the atmospheric 
opacity as a function of optical depth and wavelength us- 
ing a number of different opacity sources (Rayleigh scat- 
tering, electron scattering, bound-free opacities etc.). Under 
the assumption of LTE, the source function, 5*1,, is equal to 
the Plank function, and the latter can be computed directly 
from the temperatures tabulated in the model atmosphere. 
Using the source function and opacities, spectrum calcu- 
lates the synthetic spectrum in small wavelength portions. 
This is done by calculating the continuum and line absorp- 
tion strengths at each wavelength and evaluating a Voigt 
broadening profile for the lines using van der Waals, natural, 
and quadratic Stark broadening. More details of spectrum 
can be found in Gray & Corbally (1994). 

Table |l| lists the spectra which we generated. The logg 
= 4.0 and logg = 4.5 models are dwarfs and the remain- 
der correspond to giants or subgiants. The spectra in Ta- 
ble ^ were calculated for each of four metallicities, [M/H] 
= 0.0, —0.2, —0.5, —1.0, giving a total of 155 spectra at 
each metallicity. A microturbulence velocity of Vmicro ~ 
2.0kms~^ was used throughout, spectrum creates good 
quality spectra over a fairly wide range of spectral types, 
but it does not produce very reliable spectra for very early- 
or very late-type stars. The generation of accurate spectra 
at the hot end is inhibited primarily by the assumption of 
LTE, as NLTE effects are important in O-type stars. This 
should not have any significant effect on the calibrations of 
the MHD spectra, as the earliest type stars present are B2. 
At the other end of the temperature scale, spectra cooler 
than about 4250 K (approximately K7 and later for dwarfs, 
and K3 and later for giants) are not accurately synthesized 
on account of the absence of many important molecules (e.g. 
H2O) from Kurucz's models, and on account of the fact that 
SPECTRUM does not include TiO in its opacity calculations 
(although it does include MgH, C2, NH, CH, CN and SiH) 
(R. Gray, private communication, 1996). The quality of the 
spectra will also start to decrease for stars later than about 
G2 (solar), because in these stars a significant part of the line 
formation occurs in the chromosphere, which is not modelled 
by SPECTRUM. 

5.2 Processing 

The synthetic spectrum generated by spectrum is an energy 
fiux spectrum evaluated at 0.02 A intervals. If a neural net- 
work is to be trained on a set of synthetic spectra and used 
to get physical parameters for real observed spectra, then 
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Table 1 . Synthetic spectra generated and used for network train- 
ing. A spectrum was generated at each of the four metallicities, 
[M/H] = 0.0, -0.2, -0.5 and -1.0, yielding a set of 155 spectra 
per metalhcity. The microturbulence velocity, Vmicroi wa^s fixed 
at 2.0kms~^. The temperature steps above are the finest steps 
in which Kurucz's model atmospheres are available. The gaps at 
high temperature represent unstable model atmospheres which 
blow apart due to high radiative fluxes (Kurucz 1992). SPECTRUM 
takes about one hour (on a SUN Sparc 10) to calculate a synthetic 
spectrum in 0.02 A steps over the range 3790A-5200A. 
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the observed and synthetic spectra must be processed into 
a homogenous form. Specifically, they must have common 
wavelength and flux scales. Because we are free to generate 
the synthetic spectra at high resolution and infinite S/N, we 
chose to process these into the format of the MHD spectra, 
rather than vice versa. The processing steps are summarized 
in Figure ^ 

When non-linear processing operations are involved, the 
order of the operations in this processing is relevant. It 
should correspond to the effective processing performed by 
the telescope, disperser and detector in obtaining a real spec- 
trum. The first stage of processing the synthetic spectra was 
therefore to re-bin them (conserving flux in the process) to 
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Figure 2. Processing a synthetic spectrum, (a) A synthetic spec- 
trum as calculated by SPECTRUM, with parameters [M/H] = 0.0, 
log g = 4.0, Tgfl = 9000, shown sampled at 0.1 A on a linear wave- 
length scale. The flux is in units of erg cm~^ . (b) The 
spectrum is re-binned to the dispersion of the MHD spectra and 
area normalized, (c) The spectrum is blurred with a cr = 0.88 pix 
(= 3" FWHM 'seeing') Gaussian filter, (d) The continuum is re- 
moved, (e) A bright (mag = 8.1) MHD spectrum (HD 23164) of 
spectral class Al V, which has similar physical parameters to the 
synthetic spectrum shown in (d). 

the dispersion of the MHD spectra. This required the wave- 
length calibration of the MHD spectra. We obtained this 
from the average spectrum of the MHD data set to give a 
calibration which averaged out any slight wavelength mis- 
alignments incurred when reducing the objective prism spec- 
tra. A third-order fit to 16 identified lines in the spectrum 
gave an RMS wavelength-pixel calibration error of < 0.4 A. 
As the synthetic spectra are over-sampled relative to the real 
spectra by a factor of at least 50 {— 1.05 Kpix~^ A), we 
were justified in linearly interpolating the synthetic spectra 
across their 0.02 A-wide bins when re-binning them to the 
MHD flux bins. This re-binning yields a spectrum such as 
that shown in Figure Note that the apparent number 
and strength of the metal lines below some equivalent width 
is greatly reduced. 

The next step was to 'blur' the spectra to simulate at- 
mospheric turbulence ('seeing') and telescope tracking er- 
rors. This blurring was achieved by convolving a Gaussian 
(truncated at ±3cr) with the spectrum. This Gaussian was 
normalized to conserve flux. A smooth blurring (and flux 
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conservation) at the end of the spectrum was obtained by 
reflecting the spectrum about its ends prior to the convo- 
lution. We experimented with a number of different values 
of the a parameter to obtain a best match between the ap- 
pearance of a blurred synthetic spectrum and an MHD spec- 
trum of similar physical properties (as inferred from the MK 
classification and visual inspection). Given that each bin in 
an MHD spectrum corresponds to 1.45", a FWHM (2.35cr) 
seeing of 6 arcseconds corresponds to cr = 0.29 x 9pix~^ 
arcseconds. We found that the optimal blurring was about 
3" (0.88 pixels), which seems reasonable given the relatively 
poor tracking ability of the Michigan Curtis Schmidt tele- 
scope. 

The final stage of preprocessing was continuum removal. 
This was done in exactly the same way as for the MHD spec- 
tra: a spectral continuum is produced by median and then 
boxcar filtering the spectrum. This continuum is subtracted 
from the original spectrum to yield a rectified (line-only) 
spectrum. To improve the continuum fit in the region of 
broad lines (most importantly H lines, the CN band and Ca 
II H&K lines), these regions were masked off prior to the 
median filtering. Further details are given in Bailer- Jones 
et al. (1997b). 

The final spectrum is shown in Figure ^d above a real 
spectrum of similar physical parameters. The removal of the 
continuum from the synthetic spectra gives a good line-only 
spectrum everywhere apart from at the very blue end. This 
was not apparent in the continuum removal of the MHD 
spectra as the plate QE is very low at the blue end of the 
spectrum. To improve the match between the spectra, we 
chose to exclude the first ten fiux bins from all synthetic 
and MHD spectra when using them together in a neural 
network. The final spectra therefore consist of 810 fiux bins. 



6 NEURAL NETWORK CALIBRATION 
6.1 Teff Calibration 

To determine effective temperature, Tctt, for the MHD spec- 
tra, a neural network was set-up with a single continuous 
output to represent T^s- The type of neural network used 
had an output range confined to 0-1. In order to fit the 
4000 K to 30 000 K temperature range of the synthetic mod- 
els comfortably into this range, we used the transformation 
T — (log Tcff— 3)/2, where T is the network target. A more 
straightforward implementation would be to use a network 
with linear outputs which has an unbounded output range, 
although we would probably still want to use log TeS as 
the target in order to reduce the dynamic range. The exact 
transformation used should not matter. 

Following on from the work of Bailer- Jones et al. 
(1997a), we used an 810:5:5:1 neural network architecture 
for this calibration problem. This refers to a neural network 
with 810 inputs (the number of fiux values in the spectrum), 
5 nodes in each of two hidden layers (Figure ^ shows a net- 
work with only a single hidden layer) and a single output. 

Rather than using a single neural network to give pre- 
dictions of Teff, we used a committee of ten neural networks 
differing only in their initial random weights. The ouputs 
given by each member of the committee are averaged to give 
a single prediction which is generally more reliable than that 
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Figure 3. Teg-SpT relationship for the MHD spectra from a 
committee of neural networks trained on [M/H] = —0.2 syn- 
thetic spectra. The relationships for luminosity classes III, IV 
and V are shown separately, although the values of T^g for these 
were obtained from the same committee. The lines show calibra- 
tions from the literature for comparison. The solid line is the 
calibration of Gray and Corbally (1994); the dashed line is that 
of Schmidt-Kaler (1982); the dotted line is that of Gray (1992). 
Gray and Corbally only give a calibration of dwarf (V) stars, but 
this calibration line has been included in the top two plots to ease 
comparison between the plots of the neural network calibrations. 
Schmidt-Kaler and Gray both give separate calibrations for giants 
(III). None give calibrations for the subgiants (IV). 



given by any single network. Four training data sets were 
constructed, one for each of the four metallicities at which 
synthetic spectra were generated: [M/H] = 0.0, —0.2, —0.5 
and —1.0. Thus each training set consisted of 155 spectra of 
the same metallicity and with values of T^b and log g shown 
in Table ^ A separate committee was trained on each of the 
four data sets. Once trained, the committees were used to 
evaluate TcS for all of the MHD spectra. Figure ^ shows the 
Tcff-SpT calibration produced by the committee trained on 
the [M/H] = —0.2 spectra, over-plotted by calibrations from 
the literature. 

The MK spectral type classifications along the a;-axis 
are the catalogue classifications of the MHD spectra as given 
by Houk (1975, 1978, 1982, 1988), converted into codes on a 
1-57 numerical scale as shown in Table ^. These classifica- 
tions are all either integral or half-integral spectral subtypes, 
so the data are discrete along the a;-axis. Because the syn- 
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committee of neural networks trained on [M/H] = —1.0 synthetic 
spectra. See caption to Figure ^ for further details. 



thetic spectra are only valid for temperatures above about 
4250 K, we would not expect the neural networks to give re- 
liable calibrations for spectra later than about K5, and we 
have omitted all spectra later than K5 from these calibration 
plots. 

Apart from a handful of outliers, the majority of the 
2732 dwarfs in the bottom panel of Figure ^ show a rea- 
sonably 'tight' correlation between MK spectral type and 
Teff. Their distribution also agrees quite well with the pub- 
lished calibrations. The giant stars also show a reasonable 
Teff-SpT correlation, although it deteriorates towards cooler 
stars, most probably due to the lower quality and accuracy 
of the synthetic spectra in this region. We have not discov- 
ered any Tcft-SpT calibrations in the literature with which 
to compare the subgiant calibrations, but our data shows a 
reasonably tight correlation between TcS and spectral type. 

As a comparison. Figure ^ shows a similar plot to Fig- 
ure ^ for a committee of ten networks trained on the [M/H] 
= —1.0 spectra. Here we see a slightly tighter correlation for 
the dwarfs, but the agreement with the published calibra- 
tions is not as good. The correlations for networks trained 
on [M/H] = —0.5 and [M/H] = 0.0 spectra were also quite 
tight, but they did not agree with the literature calibrations 
for both giants and dwarfs as well as [M/H] = —0.2. Metalli- 
city effects on the calibration are discussed in section 
below. 



6.2 



The data in Figure^ can be used to give a statistical cal- 
ibration between Teff and spectral type. The most suitable 
way of achieving this is by forming the frequency distribu- 
tion, DsiTcs), of Teff for each spectral type separately, where 
s indicates that D is different for each spectral type. With a 
sufficiently large amount of data, Ds{Tes) would be approx- 
imately Gaussian. The median of Ds(Teff), Teff, is a robust 
measure of the Toff calibration for this spectral type. An ap- 
propriate measure of the spread of -Da (Teff ) is cres- This is 
the value which confines 68% of the data, and is equal to 
la if the distribution is Gaussian. For some spectral types 
there were only a few spectra; in these cases it was necessary 
to linearly interpolate Ds(Tcfr) in order to yield an accurate 
measure of o-gg- 

Figure ^ plots these calibrations and their cres errors. 
It shows a very good agreement with the published cali- 
brations, particularly for the dwarf stars. At the ends of the 
spectral type scale the calculated errors become significantly 
larger. This could be partly on account of the increased diffi- 
culty of the neural network to interpolate its mapping func- 
tion at the ends of the output range, as the interpolation is 
less well constrained in these regions. The larger errors at 
the cool end are more likely, however, to be attributable to 
the lower quality of the stellar models and synthetic spectra 
at low temperatures. The larger errors at high temperatures, 
SpT ^ 20, are probably an indication of a poorer determina- 
tion of the Toff-SpT relationship which in turn is the result 
of there being fewer spectra in this region. Table |2| tabulates 
the calibrations and errors in Figure H and shows the num- 
ber of spectra. A'', of each spectral type used to determine 
Teff and a^s- Note that the literature calibrations will also 
have some degree of uncertainty, although as the respective 
authors do not provide details of this it is difficult to perform 
rigorous statistical tests of the level of agreement between 
our calibrations and the literature ones. 

It is important to realise that the MK spectral type 
parameter is not a continuous variable. It is a set of dis- 
crete classes, each of which does not correspond to a single 
unique temperature. There is, therefore, a cosmic scatter of 
temperature about the median calibration value, TcB, and 
this intrinsic scatter will make a significant contribution to 
(768 (other contributions are discussed below). The impor- 
tant point is that aes is not a measure of the error with which 
Teff has been determined. The error in Teff itself is given by 
the standard error in the median, which is smaller than a^s 
by a factor of 0.79VN (= ^2A/7r) for large A^. This error 
for F5 dwarfs, for example, is 280/(0.79^269) = 22 K. Al- 
most all of the values of Ds{TaB) in Table |2| have standard 
errors in the median of less than 50 K. This is about as small 
as can be meaningfully reported. In comparison, Jones, Gi- 
more & Wyse (1996) used a broad band photometric index, 
{V — I)c, to determine temperature. They showed that an 
error in this index of ±0 ™ 05 gives rise to a temperature 
error of 120-240 K (depending upon the spectral type). An 
error of ±0™02 (which is about the limit of the photometry) 
would yield errors of 47-97 K. 

Cosmic scatter is not the only factor contributing to creg • 
Another source of error is the simplifying assumptions used 
in generating the atmospheric models and synthetic spectra, 
notably the assumption of LTE and the absence of certain 
atoms and molecules. Additionally, the synthetic spectra do 
not display variances in other stellar properties which will 
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Figure 5. Teff-SpT calibration of data shown in Figure ^ ([M/H] = -0.2). 
The squares correspond to the median, T^g, of the frequency distribution 
of Tgff for each spectral type. The error bars are the CTes errors, i.e. those 
bounds which include 68% of the data. Those spectral types for which there 
were only one or two points in the distribution are marked with a cross. These 
points are not very reliable and as the error cannot be reliably determined, 
it is not shown. The overplotted lines are the calibrations of Gray and Cor- 
bally (1994) (solid line), Schmidt-Kaler (1982)(dashed line) and Gray (1992) 
(dotted line). We found no published calibration for the subgiants (IV) and 
instead have overplotted the middle figure with the giant (dot-dash line) and 
dwarf (dash line) calibrations of Schmidt-Kaler. Note that the axis scales are 
slightly different here from those used in Figure ^. 



show cosmic variance, such as the relative abundances of 
atomic and molecular species or Vmicro. These will produce 
variations in the MHD spectra which are 'unexpected' by a 
network trained on the synthetic spectra. Another source of 
error is likely to derive from the neglect of line formation 
mechanisms in the chromosphere. 

Whilst we made efforts to process the synthetic spectra 
into the same format as the MHD spectra, they still differ 
in the possibly important respect that the synthetic spec- 
tra are on a linear flux scale, whereas the MHD spectra are 
on a non-linear flux scale. This difference could be removed 
by flux calibrating the photographic plates (e.g. by identi- 
fying spectrophotometric standards on the plates). A final 
problem could be that the synthetic spectra have an unreal- 



istically high S/N. This could lead the networks to lock onto 
certain weak features in the synthetic spectra that cannot 
be used as a source of class discrimination in the lower S /N 
MHD spectra. One way around this would be to add random 
noise to the synthetic spectra and use each synthetic spec- 
trum several times in the training set with different random 
noise added in each case. Nonetheless, the narrow distribu- 
tion of residuals about the median calibrations demonstrate 
that these potential problems have only a small effect on our 
technique and on the resulting calibration shown in Figure ^ 
and Table |. 
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Table 2. Teff-MK spectral type calibrations from a committee of neural networks trained on [M/H] = 
—0.2 spectra. These values are the tabulations of those shown in Figure ^ T^g is the median calibration 
temperature and cres is its error bar shown in Figure ^ Af is the number of stars of each spectral type, and 
hence the number of points used to define the distribution from which T^g and crgg are determined. Note 
that (Tgg measures the spread of this distribution and is not the standard error on the median, which is 
<T68 ^J-k/^N. 
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6.2 Metallicity Effects 

It is apparent from Figures ^ and ^ that, when the metalli- 
city of the synthetic spectra is decreased, the neural net- 
works give systematically lower effective temperatures for 
stars of a given spectral type. The calibration results from 
training neural networks on synthetic spectra of [M/H] = 
0.0 and of [M/H] = —0.5 confirm this trend. An explanation 
of this will be discussed shortly. This is an important re- 
sult because it demonstrates that the network is sensitive to 
metallicity, and that in principle this sensitivity can be ex- 
ploited to give metallicity calibrations for the MHD spectra. 
How this could be done will be discussed later. 

The best agreement between our neural network cali- 
brations and the literature calibrations was obtained when 
the networks were trained on [M/H] = —0.2 spectra. The 
literature calibrations are of course based on the same orig- 



inal MK standard spectra; hence the metallicity at which 
an agreement is achieved tells us something about the mean 
metallicity of the MHD stars and the MK system in general. 
That this comes out to [M/H] = —0.2 is not surprising, as 
it is well known that [M/H] = —0.2 for the local disk (see, 
for example, Wyse & Gilmore, 1995, and references therein 
to earlier work). If the distances to the MHD stars are cal- 
culated and plotted as a histogram, the giants and dwarfs 
show a peak in the distribution at « 400 pc and ~ 100 pc re- 
spectively (Bailer- Jones 1996). For many of the stars, which 
are at high Galactic latitudes, these distances are approxi- 
mately distances out of the Galactic plane. The MHD stars 
are therefore primarily Galactic thin disk stars. The thin 
disk has exponential scale heights of « 100 pc and « 300 pc 
for the young and old stars respectively (Gilmore & Wyse 
1985). Fitting a Gaussian to the distribution of stars plotted 
against [Fe/H], Gilmore and Wyse (1985) reported a mean 
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Figure 6. JVIetallicity effects in the synthetic spectra. The top 
four spectra are Tgff = 10 000 K, log g = 4.0. The bottom four are 
Tgjf = 6250 K, logg = 4.0. In each case metallicity increases down 
the box, with the value of [M/H] shown on the left. The spectra 
have been re-binned to the dispersion of the MHD spectra, but 
have not been blurred. 



abundance [Fe/H] — 0.0 and apc/H] ~ 0.15 for the young 
thin disk, and [Fe/H] = —0.3 and (J[fc/h] = 0.2 for the old 
thin disk. (As spectrum uses standard relative abundances, 
[M/H] ~ [Fe/H] for disk stars.) It is therefore plausible that 
the MHD spectra have an average metallicity of —0.2. It 
is certainly unlikely to be as high as 0.0 or as low as —0.5 
or —1.0. By comparison, the metallicity fit for the Galac- 
tic thick disk (scale height Ikpc) is [Fe/H] = —0.6 and 
""[Fo/H] = 0.3, and for the Galactic extreme spheroid (scale 
height > 4kpc) it is [Fe/H] = —1.5 and crpc/H] = 0.5. 

Why do the Teg calibrations change with metallicity? 
The strength of the metal lines in a synthetic spectrum in- 
creases with increasing metallicity, as can be seen from Fig- 
ure ^ However, at a fixed metallicity, the apparent number 
and strength of the metal lines also generally increases as 
Tcfi decreases. Suppose a network is trained on metal rich 
stars and gives the Tag of a certain MHD star as T. A neu- 
ral network trained on metal weak stars would assign this 
MHD star a temperature lower than T, because this MHD 
spectrum is closer in appearance to a cooler star in the set of 
metal weak spectra. In other words, because the neural net- 
works are trained on synthetic spectra of a single metallicity, 
they are not entirely able to distinguish metallicity features 



from temperature ones when attempting to classify MHD 
spectra. We tried to solve this problem by training a net- 
work on a set of spectra of all four metallicities (a total of 620 
spectra), with the hope that the networks could marginalize 
over metallicity effects. But this gave even poorer calibra- 
tions, both in terms of poorer agreement with the published 
calibration and a poorer Toff-SpT correlation. 

This confusion of temperature effects with metallicity 
ones is not just a problem for neural networks. There is 
a real correlation between metallicity and temperature. At 
lower temperatures there is less excitation and ionization 
of the metals, and hence more metal lines in the spectrum. 
Although the [M/H] value of the MHD star is constant, it 
is difficult for the neural network to isolate those features 
which vary only with metallicity and not with temperature. 
Thus simply training a neural network on whole spectra with 
a range of metallicities is unlikely to help, because, rather 
than permitting the network to 'ignore' metallicity effects, 
this is more likely to mislead it. 

It is clearly essential that metallicity effects are consid- 
ered when attempting to obtain Tctt calibrations for spectra. 
If the metallicity of the observed spectrum could be deter- 
mined first, then it could be used as an extra input to a net- 
work along with the spectrum. If this network were trained 
on synthetic spectra with a range of metallicities, then we 
could imagine that the additional metallicity information 
would permit an appropriate Toff determination. A more 
suitable alternative would be to have a series of networks, 
each trained on spectra of a single metallicity, and then to 
use the network with the appropriate metallicity to evalu- 
ate Tcft : This is more or less what we did earlier by selecting 
that metallicity which gave the best agreement with existing 
Toff-MK cahbrations. The drawback with these approaches 
is that they require the metallicity to be known in advance. 
Jones, Wyse & Gilmore (1995) and Jones et al. (1996) de- 
scribe a method of determining metallicities from relatively 
low S/N ratio (10-20) spectra, although at a higher resolu- 
tion than the MHD spectra. They use spectroscopic indices 
calibrated by synthetic spectra to measure [Fe/H] and avoid 
confusion with temperature by using an independent photo- 
metric measure of Tcft. 

Despite the confusion between temperature and metalli- 
city. Figures |^ and ^ nonetheless show that metallicity pro- 
duces a systematic rather than a random effect on the T^s- 
SpT correlation. In principal, therefore, this metallicity sig- 
nature can be exploited by the neural networks to obtain 
metallicities for the MHD stars. The question is how to do it. 
One solution may to be to determine [M/H] and Toff simul- 
taneously, e.g. by using a neural network with two outputs. 
A direct approach to determining metallicity independently 
of temperature may work by identifying only those spectral 
regions which are most sensitive to metallicity and least sen- 
sitive to temperature, such as certain iron lines in late-type 
stars (Keenan& McNeil 1976). With only these regions as 
inputs to a neural network, the 'noise' caused by tempera- 
ture variations is greatly reduced, making it more likely that 
the neural network could recognise metallicity features. This 
is an area for future work. 
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7 SUMMARY 

We have shown that neural networks trained on syn- 
thetic spectra provide low-error predictions for the effective 
temperature, T^g, of a star based on the star's optical spec- 
trum. By applying these neural networks to spectra with 
exisiting MK classifications, wc have obtained a calibration 
between the MK spectral type parameter and Tcb- This cal- 
ibration shows a good agreement with a number of cali- 
brations from the literature. The calibration was obtained 
statistically from a number of optical spectra at each spec- 
tral type with a resolution of ~ 2 Apix^^. The precision of 
this calibration is largely limited by the cosmic scatter in 
temperature for a given MK class and by limitations of the 
stellar models. It has also been shown that metallicity effects 
have to be considered when trying to determine Teff. Fur- 
ther work is required before neural networks can be used to 
accurately quantify metallicities. In particular, further pro- 
cessing of the synthetic spectra into the format of the MHD 
spectra (e.g. by the addition of noise and calibration of the 
flux scale) may be required. Our work nonetheless demon- 
strates that our networks are sensitive to metallicity. 
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